Picture for Weizhu Chen

Weizhu Chen

Test-time Recursive Thinking: Self-Improvement without External Feedback

Add code
Feb 03, 2026
Viaarxiv icon

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

Add code
Feb 02, 2026
Viaarxiv icon

RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents

Add code
Feb 02, 2026
Viaarxiv icon

RLBR: Reinforcement Learning with Biasing Rewards for Contextual Speech Large Language Models

Add code
Jan 19, 2026
Viaarxiv icon

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Add code
Jul 09, 2025
Figure 1 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 2 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 3 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 4 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Viaarxiv icon

SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning

Add code
Jun 10, 2025
Viaarxiv icon

R&D-Agent: Automating Data-Driven AI Solution Building Through LLM-Powered Automated Research, Development, and Evolution

Add code
May 20, 2025
Viaarxiv icon

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Add code
Apr 30, 2025
Figure 1 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 2 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 3 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 4 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Viaarxiv icon

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Add code
Apr 29, 2025
Figure 1 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Figure 2 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Figure 3 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Figure 4 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Viaarxiv icon

Scaling Laws of Synthetic Data for Language Models

Add code
Mar 26, 2025
Figure 1 for Scaling Laws of Synthetic Data for Language Models
Figure 2 for Scaling Laws of Synthetic Data for Language Models
Figure 3 for Scaling Laws of Synthetic Data for Language Models
Figure 4 for Scaling Laws of Synthetic Data for Language Models
Viaarxiv icon